-
Notifications
You must be signed in to change notification settings - Fork 423
Fix build and update to CUDA 12.4 #1925
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@jordimas PR #1905 was solving the same issue. I also explained all the error details there. Given that you're the new admin, it is suspicious the fact that you ignored that PR (the only green CI for months) and decided to create a duplicate PR. 😡 You did not event leave a comment or mention that PR 👎 |
|
How does this interplay with the fact that one loads CTranslate2 from a docker image where CUDA runtime installed is >= 12.4? Why do we compile against 12.4 still and also how does the |
|
Hello. None of the changes that I did impacts the Docker container. It compiles and runs still with 12.2 and works fine. If you have observed any problem please share the details. Here there is a new Dockerfile that supports Cuda 12.4: I will appreciate if you can test it. |
|
Probably @ozancaglayan implies that |
|
Sorry for being unclear:
But in this repo and related to this PR, I see that there's an explicit compilation stage which was using CUDA 12.2 and now is updated to CUDA 12.4. Probably all minor releases of CUDA 12 are compatible and one can switch to a newer one at runtime, right? If we were to try CUDA-13 for example, what would be the steps for that? |
|
What about this issue [int8 doesn't work on 50xx GPUs]: #1865 |
|
Hello @ozancaglayan I am not familiar with CUDA-18 runtime, to my knowledge the latest version is 13. |
|
Sorry I meant CUDA 12.8 🤦 |
|
When it comes to CUDA compatibility, review the following for full compatibility issues/conflicts. I'm also including compatibility regarding other well-known libraries like flash attention 2, triton, etc, since most programs typically use all of the above. any way to make ctranslate2 more compatible?? NOTE, I only do Windows since that's all I can test...but shouldn't be too hard to pull similar info for Linux users: |
I give you credit man. I've notice that sometimes people contribute their free time to create a PR only to have someone create another PR that's 95% similar all on their own...and then no recognition. At the same time, glad that the peeps at Ctranslate2 are finally letting someone actually update things for this great repository...and I'm sure @jordimas (who I've communicated with before, good guy) isn't getting paid to do this sort of this for the repo either so...lol. |
Changes: